/o v p ^ 

W6 2 3W»%) 



SUBSTITUTE SEQUENCE LISTING 



^J^AO^t GENERAL INFORMATION: 



(i) APPLICANT: 
JOHN JOSEPH OKULEV 



JONATHAN EDWARD LIGHTNER 



71 TLE OF INVENTION: GENES FOR MICROSOMAL 
FATTY ACID DELTA- 1 J. 
DESATU RASES AND RELATED 
ENZYMES FROM PLANTS 



(iii) NUMBER OF SEQUENCES: 61 

(iv) CORRESPONDENCE ADDRESS: 

(A; ADDRESSEE: F. Z . DU PONT DE NEMOURS 

AND COMPANY 

(B) STREET: 100/ MARKET STREET 

(C; CITY: WILMINGTON 

(D; STATE: DELAWARE 

{ E ) COUNTRY : U.S.A. 

(F) ZIP: 1 9-r-E*e 

(v) COMF UTER REAE'ABLE FORM: 

(A; MEDIUM TYPE: Fi oppy disk 

COMPUTER: M-aoIntosr. 
( 0 : O z D PAT 1 1 ] ■ Y S T F,M : Ma c 1 1 r_ o t h 0.0 
(D E O FTW AR.i : lY'^Ociji" Wv^ 4.0 

(vi) CURRENT APPLICATION DATA: 

(A) A? PL I CAT DON NUMBER : 
(Pi FILING DATE: 

( o ■ class if: cA/r i or J : 

;vii) FPTiP: APPLICATION DATA: 

(A APr LIGATION NUMBER: U.S. 07/97 7,339 
(B FILING ['ATE: 1 ' - MOV- I ; 1 H 0 



(viii - ATTORNEY .'ACCENT INFORMATION: 

(A- NAME: S • :«"• 1 1 , ZarY.ar-z 0. 

{ B ; REGINTFATIuN NUMBER: ;.0,684 

(C : REFERENCE/ DC 'v^ET NUMBER: BB-1 043-B 



iixi T LLE( 'i MMUNI 0 ATIi jN INFORMATION: 

(A; TELEPHONE: 3<-0} 9^2-4^37 
( B ) TELEFAX : 3><3 s 9i -7 9 4 m 



(A) LENGTH: 13'-- Joase pairs 
(C) 3TPANDEDNESN : double 

,U] MOLECULE TYPE: cDNA 



1 



(iv) ANTI-SENSE: NO 

(vi) -ORIGINAL SOURCE: 
(A) ORGANISM: Arat idopsis thaliana 

(vii) I MMEE) 7 ATE 30Vr.CE : 
{31 SI.- NE : d32K.~ 



(lx) EEATURE 

(A) NAME /KEY 

(B) LOCATION: . 124 4 

(xij SEQUENCE DESCRIPTION: SEC ID NO : 1 : 
AGAGAGAGAG ATTCTGCGGA GGAGCTTOTT CTTCGTAGG' i TGTTCATCGT TATTAACGTT 



6 0 



ATCGCCCCTA CGTCAG'^TCC ATOTO'CAGAA A: ATG G5T GCA GGT GGA AGA ATG 

Meo Gly A La GLy GLy Arg Met: 
I 5 



CCG GTT OCT AC T TCT TCC AAG AAA TC 0 GAA AGO GAG ACC ACA AAG CGT 
Pro 7a L Pro Thr S-r Ser Lys Lys Ser Glu Thr Asp Thr Thr Lys Arg 
10 1= * 2 0 



161 



GTG CCG TGC GA 0 AAA CCG OCT TCC TC 2 GTG GGA GAT CTG .AAG AAA GCA 
7aL Pro Oys Gla Lys Pro Pro Pne Ser Val Gly Asp Leo Lys Lys Ala 
2 5 : 3 0 



lie Pro Pro His Cys Pne Lys. Arg Ser lie Pro Arg Ser Pne Ser Tyr 

4D 4=- 5 'i 5 5 



:tt atc agt 



ATT ATA GOG TCA TGC TTC TAG TAO 



Leo lie Ser Asp lie lie Tie Ala Ser Cys Ph- Tyr Tyr 7a 1 Ala Thr 

5 



7 ' 



3 Co 



AAT TAG TTG TCT CTC CTC OCT CAC CCT CTC 



- r LJ u l, 



eo s 1 ro G i_n ? ro Lo a 



TAG TTG GCT TGG CCA 
Tyr Leu Ala Trp Pro 
35 



;t caa ggc tgt g 



-L - O 1 



e a Tyr Trp Ala Cys Gin Gly Cys 7a 1 

9 J 1 5 



GGT AiC i GG GTO' ATA 
Gly I le Trp 7a 1 lie 

1 D J 



4 01 



;ac: gaa rcc ggt gac 



■ ... M. 1 _. ' JJ ' 



:a tt. agc 



tag caa 



TCP TGG AA2 TAT A CAT 
140 



:,t cac cat to: aac a:t 



1 4 5 



135 

t:c cto 



GAA AG A GAT GAA GT 



^ ill 



:CA AAG CAG AAA T'"A CCA ATC AA f " 



1 



TAG GGG AAA TAG GTG AAZ AA 2 CCT 2TT 3GA GGG ATC ATG AT G TTA ACC 
Tyr Gly Lys Tyr Leu Asn Asn Pro Leu Gly Ar-g lie Met Met Leu Thr 



1 



_ 8 r> 



64 1 



gt: -:a:; ttt gt: gtg ggg tgg 

VeU jLri Pne V a I Lou G 1. y Trp 
1 3:. 190 



TTG TAG TTA 3CC TTT AAC GT 2 TGT 
Leu Tyr Leu Ala Pne Asn Val Ser 
19-;. 



0 9 



AG A CCOi TAT GAG 



TTC 



Gly Ari; Pr:> Tyr A.-:-;p jly Phe AU 



? n t 



tgg :at tt: ttc cog aag ggt ccc 

Cys His Pne Pne Pr:» Asr. Ala ?r:» 



210 



2 1 1 



AT 1 :: TAG AAT GAG 
lie Tyr' Asn Asp 



;a gaa ggo: gtg cag ata tag gtg :gt gat ggg ggt 

Hy 



Aro Gl.u Arg 
2 s 0 



Gin Tie Tyr Leu Ser Asp Ala 



ATT GTA GOG GTG TGT TTT G GF GTT TAG GGT TAG GGT GGT GGA GAA GG2. 
1 l.e Leu Al h Va 1 0 



Phe GLy Leu Tyr- Aiu Tyr Ala Al,. Ala Gin 
2 4 0 2 4: 



ATG GGG TCG A TO- ATC TGC GTG TAG GGA GTA CCG GTT CTG ATA GTG AAT 



Mot Al w Ser Me- I le Cy: 

? r :": 



u Tyr Gly Vol Pro Leu Leu He Val As: 



GGG TTC 

■ m \ 1 - O - ,r : 



gtc ttg atc a:t tag ttg CAt; 



AOT GAT CG'" tgg T ] 



j. i *z' i . i i > r 



n ri i 5 rnr i 



GGT CAC TAG GAT 
Pr:> Hi.: Tyr Asp 

J -: :"l 



tca gag tgg gag to;-:; gtc agg gga ggt ttg gc: 

: r > Asp T r p Leu Ar<g G 1. y A 1. 3 L e ' i A 1 ; 



ACC GTA GAO- -GGA GAG TAG GGA A 
Thr Va i A:e p Ar g Asp T 



TTG AAG AAG GTG TTG GAG AAG ATT 
Gly lie Leu Asn Lye Val Pne His Asn He 



1UL 0 



ACA GAG AO A CAC GTG GGT GAT CAC CTG ':TG TCG ACA ATG "CCG- CAT TAT 
Thr Asp Thr- His V* 1. A La His Hls Leu Phe Ser Thr Met Pro His Tyr 



AAC GGA ATO- GAA GGT ACA AAO 
A.,u Ale M • G A . . i Tin ^yo 

3 .: C. 



VTA AAO! CCA ATT CTG GGA GAG TAT 

T. , v. 



O 1 V h;~rj 



34 C 



112 1 



,j ^ .-\ j. ij i n i AVjio O/a'j I JJ , _-*" 



/S 'jj± U G'. 



lie Tyr Val Glu Pro Asp Arg Glu Gly Asp' Lys Lys Gl'j 



'yr Trp Tyr Asn Asr 



CCATTTTGTT GTGTTATGAC ATTTTGGCTG CTCATTATGT T 1372 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: i 8 3 ammo acids 

(B) TYPE: amiriD acid 
i D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

'xil SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly Ala G . y G-ily Arg Met Pru Vai Pro Thr S^r Ser Lys Lys Ser 
1 5 10 15 



Liu Thr A;E p Thr Thr Lys Arg Val Pro Cys 3Iu Lys fro Pro phe Ser 
2 0 2 5 " 3 0 



Val Gly Ajp 



eu Lys Lys Ala lie Pre. Pro His Cys Phe Lys Arg Ser 
4 1 4 5 



LLe Pro Arg Ger Phe Ser Tyr 

5 5 



Asp I.e lie lie Ala 



:ys Phe Tyr m '/r Ya.l Ala Thr Asr. Tvr ?h^ S--r L*o; Leu Pro Gin Pro 



5P5 0 1 1 9 5 

Leu Thr Gly I.;, e Trp Val lie Ala His Gia Cys G.i y His His Ala Phe 



Ser Asp Tyr Gin Tip Lou Asp Asp Thr Val Gly Leu lie Phe His Ser 

i:5 12 0 125 

Phe Leu Leu Val Pro Tvr Phe Ser Trp Lys Tyr Sor His Arg Arg His 

1 3 L 13 5 11C 



H:s Ser Asn Thr 
LL5 



Ana Asp Glu iVM Phe Vai Pr^: 



1 o Li 



::r Ala lie Lys Trp Tyr G ... y Lys Tyr Leu Asn Asn Pro Leu 



_s Phe Phe 



2 0 0 2 0 5 

As n A 1 a P r o lie T y r As n Asp A r a G 1 u Arg Leu G 1 n 



Tyr Leu Ser Asp Ala Gly lie Leu Ala Vai Cys Phe Gly Leu Tyr 



/Org 



Ala AM 



Ma Ser- Met lie Cys Leu Tyr Gly 
4 



2 50 



2 55 



Val Pro Leu Leu lie Val Asn Ala Phe Leu Val Leu lie Thr Tyr Leu 

2 o ■ > 2 6 5 2 7 0 

Gin Hi-. Thr His ?r:> Ser Leu Pr His Tyr Asp> Ser Set Glu Trp Asp 

275 2 35 

Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie Leu 



Asn Lys Val Phe Mrs Asn He Thr Asp- Thr His Val Ala His His Leu 
3 0 5 3 10 31 5 3 2 0 

Phe Ser Thr Met Pro His Tyr Asn Ala Met Glu A La Thr Lys. Ala He 
^2: 3 30 335 

Lys Pro lie Leu Giy Asp Tyr Tyr Gin Phe Asp* Gly Thr Pro Trp Tyr 
'•A 1 ' 3 4.: 350 

Val Ala Met Tyr Arg Glu Ala Lys Glu 3ys He Tyr Val Glu Pro Asp 

35 5 3*0 3 65 

Arg Glu Gly Asp> Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys Leu 

37 0 37 5 33 0 



. Mr ORMATT ON PC)?. 



r it, -j rn 



-A) LENGTH: 1426 5 .o> pairs 

■ B i TYPE: nuclei c soid 

< C i STPAN jEDNESC : single 

■ D j TOPOLOGY : I i nea r 

hi) MOLECULE TYPE;: cDNA to mRKA 

- on ) HYPOTHETICAL: NO 

- vr ) < 'K:: G I NAL SOURCE : 

i A 1 OFO -AN I 2-1-1 : Rras^ica napus 

i ixi EEATC'RE: 

< A ) NAME EEY: CD.: 

'xi! SEQUENCE DESCRIPTION: SEO 10 NO : 3 : 



.'A'oa:.: 1 



TT CTTCG TAG O-u-T": OATOGi' TAY'lVYACGiT AAATOTT^AT CCCC^'VTAC GTCAGCCAC-C 120 
TCAAGAAAC A'l G ( :GT GCA GGT GGA AGA ATG CAA GTG TCT OCT CCC TCC 168 

i : 10 

.AAA AAG TCT GAA AGO GAG AAC ATG AAG GGG GTA CCC TGC GAG ACA CCG 216 
Lys Lys Ser Glu Thr Asp Asn lie Lys Arg Val Pro Cys Glu Thr Pro 



Pr:> Phe Thr Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe 

3-- 3 5 4 0 4 5 

AAA GGG TGG ATC GGT CGO: TOT TTC TGG TAC GTG ATC TGG 3AC ATC ATC 312 

Lys Arq Ser He Pro Aru Ser Pne Ser Tyr Leu He Trp Asp He lie 



at;, ggo tgg tgc tt: tal tag gtg ggg a:g act tag ttg lgt 3T3 ctc :-.60 

II- ■ Al.i Ser Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Pne Pro Le a Leu 

>0- 7 0 7 L . 

GOT CAO: GGT GTG TGC TAC TTG G 3C TGG GGT GTG TAG TGG GCO' TGG GAG 4 01- 

?r--. His Pro Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala 3ys Gin 
5 0 : 3 3 ■ j 

GGG TGG GTG GTA AGG GGG GTG TGG GTG ATA GGG GAG GAG TGC GGG GAG 4.56 

Gly Cys Val Leu Thr Gly Val Trp Val. lie A .. a His Gl.u Cys Gly H is 

10-: 10 5 

CA-"* 3CC TTG AGG GAG TAG GAG T 3G GTG GAG GAG ACC GTC 3G 5 3TC AT 3 !: 0 4 

Hi:?. Ala Phe Ser Asp Tyr Glr. Trp Leu Asp Asp Thr Val Gly Leu He 

11" 115 l..u 125 

TT-: 3AC TGG TTG GTG GTG GTG C 3T TAG T TG TGG TGG AAG TAC AGT GAT 55.2 

Pro"- His 3er Phe Leu Leu Val Pro Tyr Pne S-r Trp Lys Tyr Ser His 

13: 135 i r 

14 5 1 5 0 1 5 5 

GT 1 CCC AAG AAG AAG TCA GAG ATC AAG TG 5 TAG GGG AAG TAG GTG AAG 64 i- 

Va : Pro Lys Lys Lys Ser" Asp Lie Lya Trp Tyr GLy Ly? 

1 6 0 16 5 1 7 0 



AAV 3GT TT 3 GGA CGO* AO: 3 GTG ATG TT A AGG GTT GAG TTG A0:T 



'ro Le j Gly Aru rnr Val M-eo Leu Thr 
7 : 1 i -.. 



.ii -je ^ 1 r ■ 



tgg :gt ttg tag tta go: :: ttg aag gtg tgg ggg aga got tal gag ggg 

T r: • ?r o Le i Tyr' Leu A 1 ~ rue Aon Val 3er G _ y Aru: Pro Tyr Asp Gly 

1 ;) 1^5 2 ' j 205 



GGg ttu iOOT TGG 



•A,^ .., 'JT.-V- 



:; h-r Ala Gys His Phe H : Pro Asm Ala no lie Tyr Asn Aap Ar j 



,A Oi l' GGG TGG ATG GTG £&y 



y 



TA3 GGA GTT GO I 1 GTT GTG ATT GTG AAC GG 3 TTC TTA 3 TT TT 3 
Phe Tyr Gly Val Fro Lea Leu lie Vhl Asn Gly Phe Lea Val Leu 

^5 2 60 2b5 



6 



ATC ACT TAG TT 3 CAG CA 3 A ."• : 3AT CCT TCC CTG CCT CAC TAT jAC TCG 93 4 

He Thr Tyr Lea Gin His Tnr His Pro 3er Leu Pro His Tyr Asp Ser 

27 0 27 c 2 8 0 28 0 

TCT GAG TGG GAT TGG TTG AGG GGA GCT TTG GGG ACC GTT GAG AGA GAC 10 32 

Ser Glu Trp Asp T rp Oe Arg Gly Ala Lei Ala Thr 7a L Asp Aru Asp 

2 V- 2 9^ 30 0 

TAG GGA ATC I"IG AAC AAG GTC TTG CAC AAT ATG AGG GAC ACG CAC GTG 108 i 

Tyr Gly lie Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val 

3:0 31-; * 3 1 = 

GGG GAT CAC GTG TTC TCG ACC ATG CCG GAT TAT GAT GCG ATG GAA GCT 1 1 2 

Ala His His Leu ?ne Ser Tnr Me: Pro His Tyr His Ala Met Glu Ala 

32 0 32 0 3 30 

ACG AAG GC3 ATA AAG CCH ATA CTG GGA GAG TAT TAT GAG TTC GAT GGG 117i-i 

Thr Lys Ala He Lys Pro lie Leu Gly Glu Tyr Tyr Gin Phe Asp Gly 

3 30 3 4 J 34 0 

ACG CCG GTG GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATG TAT 1.0? 4 

Thr Pro Val Val Lys Ala Met Trp Arq Glu Ala Lys Glu Cys He Tyr 

300 30 5 360 300 



GTG G/yA CCG GAO AGG 0!AA GGT GAG- AAG .AAA GGT GTG TTC TGG TAG AAG 
Val Giu Pro Asp Atu G i r. Gly Glu Lys Lys GLy Val Phe Trp Tyr Asn 



"3 1 : 



Asn Lys Leu 

TTGTT7AAGA AGO: I ATG I TO' CTGTTTCAAT AATCTTAATT ATCCATTTTG TTGTGTTTTC 1 -0 4 
TGACATTTTG GOTAAAATTA TGTGATGTTG GAAGTTAGTG TC 1426 
(2) INFORMATION E< >R 3E0 10 NO) : 4 : 

(1) SEQUENCE CHARACTERI3T ICS : 
(A) LENGTH: 384 amino acids 

B ) TYPE.: anino ic id 
( D) TOPOLOGY : linear 

M i \ r Ar^ - it^mt r f T- v - p-7 . ^ ^ .-. - -_ . ~ . 

ixO SEQUENCE DESCRIPTION: 0EC ID NO : 4 : 



'a 1 G2 y Glu Leu Lys Lys Ala lie ?r s Pro His Cys rhe Lys Ar-g Ser 



le Pro Arg Ser Phe Ser Tyr Leu lie Trp Asp lie lie He Ala Ser 

5 0 -0 0 0 



7 



Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 

7 0 7 5 3 0 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Giy Cys Val 

3 5 90 9 5 

L- 1 . Th r Giy V.i.l Trp Val lie Ala His Gl a Cys Gl.y rirs His Ala Pne 

1 J j 105 llf 

3nr Asp Tyr Gin Trp Leu Asp Asp Thr Val Giy Leu Tie Phe His Ser 

115 12 0 12 5 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser Hi.s Arg Arg His 

1 3 0 135 14 0 

His Set Ann Thr Giy Ser Leu Gl.u Arg Asp* G.u Val Phe Val Pro Lys 

115 ]50 1 L .5 IriO 

Lys Lys Ser Asp lie Lys Trp Tyr Giy Lys Tyr Leu As.h Ash Pro Leu 

16 5 5 7: 175 

G 1. y Arg Thr Val Met Leu Thr Va.l Gin Phe Thr Leu Giy Trp. Pro Leu 

110 16 5 1 90 

Tyr Leu Ala Phe Asn Val Ser Giy Arg Pre- Tyr Asp Giy Giy Phe Ala 

1 b 2 0 0 2 0 5 



G . r. lie Tyr- lie Ser Asp Ala Giy lie Lea Ala Val Cys Tyr Giy Leu 

2 . = 2 .3 5 2 .: t 2 AC 

Tyr Arg Tyr Ala Ala Val Gin Giy Val Ala Ser Met Val Cys Phe Tyr 

2 4 5 2 5- 0 2 5 5 

G...y Val Pro Leu Leu lie Val Asn Giy Phe Lf-u Val Leu He Thr Tyr 

201 265 270 

Leu Gin H.; s Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp; 



L^u Arg Giy Ala Leu Ala Thr Val Asp Arg Asp Tyr Giy He 

2 9 5 3 0 C 

Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 



. . >. ..s/s i- :: .. 1 leu . ; 1 y ul. 1 yr Tyr Gj.ii Piie Asp* G^y Thr Pro Vai 

3 4 0 ^4 5 v,.-. 

Val Lv =? AH Mp- Trp Arg n - l \: Ala Lys Glu Cy 0 1H T / r V- \ G i .: ire 

315 360 365 

Asp Arg Gin Giy Glu Lys Lys Giy Vai Phe Trp Tyr' Asn Asn Lys L*.-u 



8 



(2) INFORMATION FOR SEP I D NO : 5 



( i ) SEQUENCE CHARACTER I ST ICS : 
(A) LENGTH: 14*.?. base pairs 

■ B ) TYPE : nuclei ■?. acid 

■ C) 5 TPY'TIPEDNESG : double 



PY-LEC^LE TYPE: -DMA 
in) HYPOTHETICAL: NT) 
ii v) ANT I - SENSE : NO 



Oi-.IGINAL PO"PC 
ORGAN! SM: G 



:ir.e max 



vvii) IKMEL1ATE SOYRCE : 
' B ) CL'OJE: pSPl-lPS?' 



(ix) PEAT : J R E 
-A) NAME >' KEY 

•,B) ly :atio:j 



(xl) PEO'JENOE PES?EIPTION: 3EQ ID NO : 5 : 



tcgaat/lata caagccacta ggcatgggtc tagcaaagga aagaaca atg gga ggt 

Met Gl y G Ly 



116 



AGA GGT OCT GTG GOC AYA GTG ?.AY GTT CAA GGG AAG AAG GGT CTC TOA 
Arg Gly Ara V.il A'.. a L ;,*.-? Yu I jIu Val Gin Gly Lys Lys Pro Leu Ser 



164 



^rQ Y a j. 
2 0 



•n Thr Lv 



:'CA TTC ACT GTT GGC CAA CTC AAG AAA 
Pro Phe Thr Val Gly Gin Tea Lys Lys 
30 *T. 



GCA ATT 
Ala lie 



TTT CAG CGC TCC CTC CTC ACT TCA TTC TCC 
Phe Gin Arg Ser Leu Leu Thr Ser Phe Ser 



2 6C 



TA 



1 1 j: i I 'J 



TT TAT GAC CTT 



GCC TTC ATT TT 1 " TAG ATT 



ATC TAT TG 
lie Tvr Tr 



YT C T ( 

j 1 T cr,i 



;cr yt 



35 



4 0 4 



a:t CAC GAT 



YT GGT CAC CAT C<; 



YAA Tf 



GTT GA 



■J AT GTT GTG GGT TTG ACC CTT CAC TCA ACA CTT TTA jTC COT TAT TTG 500 

Asp Val Val Gly Leu Thr Leu His Ser Thr Leu Leu Val Pro Tyr Phe 

12 0 10 5 130 

TCA TGG AAA ATA AGC CAT CGO CGC CAT CAC TCC AAC ACA GGT TCC CTT 5-5 8 



nis n i. 5 as:, inr 'j[y ber 

14 0 14= 



GAC CGT GAT GAA GTG TTT GTC CCA AAA CCA .AAA TCC AAA GTT GCA TGG 0 06 

Asp Arg Asp Glu Vu 1 ?ne Val. ?r Lys P .r j Lys Ser Lys Val Ala Trp 

if o 15= i o : 



TTT TCC AAG TAG TTA AAC AAC CCT CTA GGA AGG GCT GTT TCT CTT CTC 64 4 

?ne Ser bys Tyr' Leu Asn Asn Pro Leu Gly Arg Ala Val Ser' be s. Leu 

1 0 ; : 17.; 

GTC ACA CTC ACA ATA GGG TGG CCT ATG TAT TTA GCC TTC AAT GTC TCT Gj.Z 

7a 1 Thr Leu Thr lie G 1. y Trp Pro Met Tyr' be ^ Ala Phe Asn 7a 1 Ser 

ISO 17 0 1^0 137 

GGT AGA ZCC TAT CJAT ACT TTT GCA AGC CAC TAG CAC C :T TAT GCT CCO 74 0 

Gly Arg Pro Tyr Asp Ser Phe Ala Ser His Tyr His Pro Tyr Ala Pro 



\ T 1 r^'T^."' -n 



i brt l ' i ! 1 OL i 

r A s p a i .-A i =i 



TT 7 TTT rCT GTG ACT TAG TCT CTC TAG CGT GTT GCA ACC CTG AAA GGG 6 3 6 

b e u Pn e S e r 7a 1 Tn r T y r Ser b e ^ T y r Ar g 7 a i Ala Thr Leu L y s G 1 y 

TTG GTT TGG CTj 'i:TA TGT GTT TAT GGG GTG CCT TTG CTC ATT GTG AAC 6 :f 4 

Leu Val Trp beu beu Cys Val Tyr Gly Val ? r :■ Leu Leu Tie Val Asn 

2 4: , 0 • 2 7 : 

GGT TTT CTT GTG ACT ATC ACA TAT TTG 37 ; CAC ACA CAC TTT GCC TTG 9:>2 

Gly Phe Lea Val. Tr.r lie Thr Tyr be i G ] n H i. s Thr His Phe Ali beu 



cct cat ta;: gat tca tca gaa tgg gag tcc ctg aag g^a gct tt«: 

Pre H i ■= Ty- Asp --r "71 u sOrr. Asp Trp 7 , . byi: 2G y AIo „e . 

b • 1 2 8 7 2 9 .. 



?hr Me r Asn A r i 



A^ A^i Hl.£ Hii be j Phe Ser Thr Met t'ro His Tyr 



His Ali Met 71 j Ala Thr Asn Ala lie Lys Pro lie Leu 71y Glu Tyr 

3 2 0 3 3 0 3 3 0 



10 



Tyr Gin Phe Asp Asp Thr Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala 
34 0 345 350 355 

AG A GAG TGC GTG TAT GTG GAO; CCA GAT GAA GGA ACA TCC GAG AAG GGC 1220 
Ara Glu Cys Leu Tyr Va 1. Glu Pro Asp Glu Gly Thr Ser Glu Lys Gly 
3ii0 365 370 

GTG TAT TGG TAG AGG AAC AAG TAT TGATGGAGCA ACCAATGGGG CATAGTGGGA 1274 
Val Tyr Trp Tyr' Arg Asn Lys Tyr 

GTTATGGAAG TTCTGTOATG TATTAGTACA TAATTAGTAG AATGTTATAA ATAAGTGGAT 133 4 

TTGCCGCGTA ATOWACTTTGT GTGTATTGTG AAACAGCTTG TTGCGATCAT GGTTATAATG 13 94 

T AAAAA T AA r I ' TCC 3GTATTA ATTACATGTG GAAAGTGTTC TGCTTATAGC TTTCTGC OTA 1454 

AAAAAAAA 14 62 
(2) INFORMATION FOP GEO ID MO : 6 : 

(i) SEOGEMCE CHAPArCTEP I ST I CS : 

(A) LENGTH : 370 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: 1 inear 



(xi) GEyGENCE DESOOP I PT I ON : SEQ ID N'O : 6 : 

M»->: Gly Gl.y Aig G..y Arg Val Ala Lys Val Glu Va L Gin Gly Lys Lys 

1 5 1 0 15 

Pre. Lea Ser Arg Val Pro Asn Thr Lys Pro Pro Phe Thr Val Gly Gin 

2 : 3 5 3 J 

le. Lys Lys- A 1 = I.e Pre Pro His Cys Phe Gin Arg Ser Leu Leu Thr 

3 5 4 5 4 5 

Sec Pne Ser Tyr Val Val Tyr Asp Leu Ser Phe Ala Phe lie Phe Tyr 

5 3 5 : 60 

He Ala Thr Thr Tyr PY* u i~ Leu Leu Pre Glu Pro Phe Ser Leu ire 

6 r " 7 0 7 5 :0) 



Trp 'hi Asr Asp Val VG 



Pro T/r Phe Ser Trp Lys lie Ser His Arg Arg His His Ser Asn Thr 
1 3 0 1 3 5 1 4 3 



15 5 1 6 0 

1 1 



Val Ala Trp Phe Ser Lys Tyr Leu Asn Asn Pro Leu Gly Arg Ala Val 

165 170 175 

Ser Leu Leu Val Thr Leu Thr lie Gly Trp Pr:< Met Tyr Leu Ala Phe 

13 0 18 5 1 ro 

Asn Val Ser GLy Arc Pro Tyr As p Ser Phe Ala Ser His Tyr His Pro 

105 2 00 205 



Tyr Ala Pro I Le Tyr Ser Asn Arg Glu Arg Leu Leu lie Tyr Val Ser 

210 215 * 22 0 

Asp Val Ala Leu Phe Ser Val Thr Tyr Ser Leu Tyr Arg Val Ala Thr 

225 210 2l: ~ 24: 

Leu Lys Gly Leu Val Trp leu Leu Gys Val Tyr Gly Val Pro Leu Leu 

2 4 5 2 51 25 5 

lie Val Ai-n Gly Phe Leu Val Thr lie Thr' Tyr Leu Gin His Thr His 

2 6 3 2 65 2V 7 0 

Phe Ala Leu Pro His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Lys Gly 
2 7 5 2 >: '": 2 8 5 

Ala Leu Ai^i Thr Met Asp Arq Asr Tyr' Gly Tie Leu Asm Lvs Val Phe 



His His lie Thr Asp Thr His Val Ala His His Leu Phe Ser" Thr Met 
5 0 5 31 1 2 1 5 1 2 0 

Pro His Tyr :i . ,. Ala Met Glu Ala Thr Asr. Ala lie Lys Pro lie Leu 

325 335 315 

Gly Giu Tyr' Tyr Gin Phe Asp Asp Thr Pro Phe Tyr' Lys Ala Leu Trp* 
i : -r > 34 5 31 0 

Arg Glu Ala A; : Glu Cyi Leu Tyr Val Glu Pre Asp» Glu Gly Thr Ser 
.3 : Q - 3 m~ 3 6^ 



;iu Lys Giy ''al Tyr Trp Tyr Arg Asn 



: "7 c: 



\NT1-SENSP : NO 



12 



fvn) IMMEDIATE SOURCE: 
(B) CLONE: pFad2#l 

(ix) FEATURE: 
(A) NAME /KEY: CDS 
( 3 } LiOCAT I i')N : I 6 5- . . 1 3 2 b 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CGGCCTCTCC CCTCCCTCCT CCCTGCAAAT CCTGCAGACA CCACCG0TC3 TTTTTCTCTC 60 

CCCGACAGCA G AAAA 2 GGG A GAGA3AGGIG AGGCGCGGD3 TCCGCCCGAT CTGCTCTGCC 120 

COGACGCAGC TGTTACoACC TCCTCAGTCT CAGTOAGGAG CAAG ATG GGT GCC GGC 176 

Met ]ly Ala jiy 
1 

3 30 AG j ATi j ACC 33-3 AAG GAG :GG GA 3 AAG 3A 3 GAG TAG CT 3 GCC CZA 2 34 

Gly Arg Met Thr Glu Lys Glu Ar? Glu Lys 3 1 n Glu G L r. Leu Ala Arg 

5 10 15 2 0 

GOT ACC GGT GGC GCC GCG ATG :AG CGG TCG CCG GTG GAG AAG CCT CCG 27 2 

Ala Thr Gly Gly A, a Ala Met 31h Arg Ser Pro Val Glu Lys Pro Pro 

. 5 3 0 ?'-. 

IT 3 ACT OTG 02 1 70277 AT 2 AO 3 2 A GCC AT "J 230G OCA 'A/ ' TGC IT GAG u 

4 0 4 5 5 0 

0 3: TO 3 GTG 3TC AAG TO Z TV Z DO 3 TAG GT 3 GTG CAC GAO: 3TG GTG ATC 068 
Arg 3er Val Leu Lys 3er ?ne Ser Tyr Val Val His Asp Lea Val lie 

5 5 60 6 5 

3 3 3 GOO GCG 3TC CTC TA3 TTC 3C 3 CTG GCC ATC ATA CCG GO 3 CTO C 3A 416 

Ala Ala Ala Leu L-u Tyr Phe Ala Le a Ala lie lie Pro Ala Leu Pro 

7 0 7 5 3 5 

AGO CCG CTC CGO TAC GCC GCC TGC CCG i3TG I AC TGG AT'3 30 3 CAC G 30 4*7 4 

3er Pra Le j Ar g Tyr' Al 3 Ala Ire ?r^ Le > I'yr Trp lie Ala Gin Gly 

i 2 707 9 5 107 j 

1 37" GT - : TC'" 1 A70 3' 73 GT 3 TO 3 3T 3 AT 3 00 3 3AO '.A'_i i 'j'Z -j 30 CAC CAC 1 .7 
2 y 5 .'a 1 Gys Tr. r G 1 y 2a 1 TO p ^ a 1 1 :_e A i. a -is Glu 3 y s 3 1 y H 1 H l s 

1 : ] 5 110 1 1 5 



.3 I- JJ 232 313 AlO '02 000 T A 2 TT 3 FOG 13 7 AAG TAC AGO 2AC CGG -078 

1 2 5 1 4 3 1 4 2 

7C 2 A (3 OA2 T22 AAC ACG 3GG TOO CTO 3A 3 330 3A2 GAG GTG TT 3 GT 3 63 6 

:q Has His Ser Asn Thr 31y Ser Leu Gl a Arg Asp 31u Val Phe Val 

15 0 152 16 3) 
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Pro Lys Lys Lys Glu Ala Leu Pro Trp Tyr Thr Pre Tyr Val Tyr Asn 
16: 170 175 180 



AAA CTG GTC GGC C2C GTG GTG CAC ATC GTG GTG GAG CTG ACC CTC GGG 752 

A sr. ?:■:■ Val Gly Arg Val Vai His He Val Val Gin Leu Thr Leu G 1. y 

1H 1 105 

TGG CTG GTG TAT' GTG GGG ACT AAG GGG TOG GOG GGG GGG TAG GGG GGG 8 00 

Trp Pro Leu Tyr Leu Ala Thr Asr. Ala Ser Gly Ar j ?r-:- Tyr ?ra Ar-: 

2 0 0 205 " 2 10 

TTC GCC TGG CAC TTC GAG CCC TAG GGG CCC ATC TAG AAG GAG CGG GAG 8 4 l c 

Pne Ala Oy.s His ?ne Asp Pr :> Tyr Gly Pro lie Tyr" Asia Asp Axoj Glu 

CGG GGG CAG ATC TTC GTC TCG GAG GCC GGC GTG GTG GCC GTG CCC TTC 8 9 <;> 

Ar? Ala -3 In He Pne Val Ser Asp Ala Gly Val Val Ala Val Ala Phe 

2:2 255 24 0 

OGG GTG TAG AAG CTG CCC CC2 CCC CTC GGG GTG TGG TGG GT 0 GT 0 CC2 94 4 

31y Leu Tyr Lys Leu Ala Ala AU Phe Gly Val Trp Trp Val Val Ar«? 

2 4 5 2 5 0 5 5 5 T 0 0 

GTG TAG GCC GTG CGG CTG CTG ATC GTG AAG CCC CCC CTC CTC CTC .ATC 9 95 

Val Tyr- Ala Val ?r:> Leu Leu lie Val Asn Ala Trp. Leu Val Leu lie 



rnr Tyr Leu G 1 r. His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser 

2 - 2 2 3 5 2 90 

2 AC To 2 CAC TGG OTA CGG GG T GO GTG GCC ACC AT'- GAG CGG GAG TAG 10c c 

?t!u Trp Asp Trp Le a Ar: Gly A, la Leu Ala Thr Met Asp Aru Asp Tyr 

2 9 ( : :-0 2 2 0 5 

GGC AIC CTC AAC CC2 CTC m T 5 CA 2 AAC ATC ACC CAC ACC CAC GTG CC 2 1150 

Oiy He Les. A;u\ At j OVA. Pne His Asn Tie T'nr Asp Thr His Val Ala 

31 : U5 32 0 

5AC CAC CTC TTC TG T ,-.CC mT j CC - GAG TAG CAC GCC /AT '3 GAG oCC ACC 1.18 4 

HI? His Leu Pne Ser Thr- Met Pr-: His Tyr His Ala Me: Glu Ala Thr 

AAG G'0- ; .-ATC A* • CCC .-U'O CT 2 CC 2 2 AC TAG TAG CAC TTC GsvO CGG AC 2 Izj Z. 

Lys Aia IT-: Ar ; Pr: II- Lea G 1 y Asp Tyr Tyr His Prv: Asp ■ r - Thr 



OA 2 Ceo o AG on-" 

10 3 5 

Ala Pro Or, Asr 



■;P A 



CGGTCGGAGA GCTGAGGACG CTACCGTAGG AATG 3GAGCA 0AAAAGGAGGA GGAGGAC A r 
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GAAGAGAGAT OATTTGGGOA GAGAGACGAA GGCTTACTGC AGTGGCATGG CTAGAGCTGC 1515 

GATOAAGTAO AAGTAGG 5AA ATTCGTCAAC TTAGTGTGTG CGATGTTGTT TTTCTTAGTC 157 5 

GTCGGCTGCT GTA 500T I'TC CGGCGGCGGT OGTTTGTGTG GTTGGCATCC GTG 10GATG0 1635 

OTGTGGGTGO GTG 5*000 5GG TTGTGGTGTG CGTCTGTCGT GGOGTTGGOG TOGTGTGTTO 1695 

GTGCTGCCCG TGTGTTGTTG TAAAAOAAGA AGATGTTT TO TGGTGTCTTT GG0 3GAATAA 175 5 

CAGATCGTCC GAACGAAAAA AAAAAAAAAA AAAAA 17 90 

(2) INFORMATION EOR GEO II. NO : 8 : 

(i.) seouenoe CHARACTERISTICS: 

fA" LENGTH : :-8'7 aitiir.o acids 

■ B i TYPE: amino acid 

f D'i TOPOLOGY : linear 

■li) MOLECULE TYPE: protein 

ixi) SEQUENCE DESORIPTION: SEQ ID NO:.:: 

Met Gly Ala Gly G-y Arg Met Tnr Giu Lys Glu Arq Glu Lys Gin GIu 

L ~ 10 13 



2 0 2 5 3 0 

cj Lys Pro Pr : Phe Thr Leu Gly Gin lie Lys Lys A L a I Le Pro Pro 

5 : 4 -I 4 5 

_s Oys Phe GO. a A:a Rer Va 1 Leu Lys Ser Phe Ser Tyr Val Val His 



Asp Leu '/a! lie Ala Ala Ala Leu leu Tyr Phe Ala Leu Ala lie lie 

6 5 7 0 7 5 8 C 

Pro A .1 ei Leu ? r : Ser P r > -> 'L e u A r q T y r Ala Ala T r p Pro Leu T y r T r p 

-i = 90 ' 95 

lie Alu Gin Gly Cy? Val Cy.=- Thr Gly Val Tr r 'VL II- Ala His Giu 

10 0 15 5 11 0 



1 1 5 



Tyr Ser His Aru Ar t HI * l O ^ Per As;. Thr Oy R-r. V-u 01 u A: j V:_^ 

14 5 15" 155 165 

Giu Val Phe Val Pro Lys Lys Lys GIu Ala Isu Pro Tris 

16 5 17 0 
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Leu Thr Leu 3iy Trp Pro Leu Tyr Leu Ala Thr Asn Ala Ser Gly Arq 

1-5 2 i'i 2 0 5 

Pro Tyr Pre Ar: Phe Ala Cys H.:.s Phe Asp Pre Tyr Gly Pro He Tyr 

21.0 J 15 220 

Asn Asp Ara Glu Arq Ala Gin lie Phe Val Ser Asp Ala Gly Val Val 

2 2 5 2 3 -J 2 3 5 2 4 0 

Ala Val Ala Phe Gly Leu Tyr Lys Leu Ala Ala Ala Phe Gly Val Trp 

2 4:- 2 5 0 2 5 5 

Trp Val Val Aiu Val Tyr Ala Val Pr:> Leu Leu lie Val Asn Ala Trp 

26: 2 7 :j 

Leu Val Leu lie Thr Tyr Leu Gin His Thr His ?r>: Ser Leu Pro His 

27 5 21-::: 28 5 

Tyr Asp Ser Ser G^u Trp Asp Trp Leu Arg Gly Ala Leu Al a Thr Met 

2 90 2 95 30'J 

Asp Arg Asp lyr G_.y lie Leu Asn Arg Val Phe His Asn lie Thr Asp> 

305 5 10 " 315 320 



Leu PL- .Spt Thr Met ?r<: 



V 'C n 1 S Ml a 



Met Glu Ala Thr Lys Ala lie Arg Pro lie Leu Gly Asp Tyr Tyr' His 
;■ 4 : '34 5. 3 5. 0 

Phe Asp Pre Thr Pro Val Ala Lys Ala Thr Trp Arg Glu Ala Gly Glu 
355 3*-0 ' <6 : - 

Cys lie Tyr '/al G...U Pre Glu Asp Arg Lys Gly Val. Phe Trp Tyr Asn 

Lys Lys Phe 
3 3 5 

(2) INFORMATION POP 2EO I D NO: 9: 

■1. ) oF,n U E I ] C. E C - A ~ A C TP P T ST 7 C S : 
LA: LENGTH: 6 7 - base pairs 
(B; TYPP: nucleic acid 



(:v) ANTI-SENSE: NO 

(vi) ORIGINAL SOU F; CP : 
( A ) O RG AN ISM: R i c i n u s c ommu n i s 
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fvii) IMMEDIATE SOURCE: 
(B) CLOIIE: pRF2-lC 



(ix) FEATURE 

(A) NAME /KEY 

(B) LOCATION 



GDI 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



TOO 3TG AT' 
Trp Val. Me 
1 



cat rev Gai; ca: sat ccc ttc ACT CAC TAT CAA 
Hi.s Asp. Cys CLy His His Ala Phe Ser Asp Tyr Gin 



10 



15 



4 8 



TTC CTT CAT 
Leu Leu Asp Asf. 



- r _ i 



CTT GGT CTT ATC CIA CAC TCC TGT CTC CTT GTC 
Val G Vy Leu lie Leu His Ser Cys Leu Leu Val 



9 6 



:T TAT TTT I'CA TO-:; .AAA. CAC AC'" CAT 
o Tyr Pne Ser Trp Lys His 3e 
3 5 4 



iC CCA CAT CAT TCC AAC ACA 
His Arq Arj His H:.s Ser Asn Thr 
45 



14 4 



;i y 



TCC CT'I 
Ser Lei. 



:,AT CAA GT3 TTT 5TT 
_u Ar-.] Asp C.u Val Phe Vnl 



AAG AAG AAA TCT AG I 
L y s L vs Lys S e r Ser 



ATC CGT TCC TAT" TCC AAA TAG CTI AAC AAC OCT 
Lie Ara T r l- T / r Ser Lys F v r Le i Asn Asn ?r :• 



ggt cgt atc at:; 

C 1 y A l [It Me i 



ACA ATT 
rhr I Le 



A1-: 



GT' : ACA CTT TCA CT 1 GGI 
Vs. Thr Leu Ser Le GL\ 



t i;n:; cci 

Trp 



:ta gca tt-: 

.en Ala Ph-i 

9 5 



2 'r 8 



aat gtt 

Asn Va I 



TCA GGO AGs; CCA TAT GAI 



GCC TCC CAC 



;er 51/ Ar g Pi o Tyr Asp Ar j Pne Ala 



10 : 



TAT GAC CO; 
His Tyr Asn Pr-: 
1 1 I 



3 .:■ (5 



'RAT GG : 
Tyr GL; 



AAT IAT 



ati 



Pre- lie Tyr Asr Asp Aro Slu Ar-; lie 

11= 1 2 J 



U.'-V.n 

a i. u 



ATA TTC ATA TCA 
1 Le Pne lie Ser 



3 L : 4 



■jAI an i 
As p A 1 a 
1 ^-"] 



GGT jTT ' 
G 1 y V. 



;CT GTC ACT TTT GGT CTC TAI 



L-u Ala Val Tnr Pr 

1 "V d 



Leu T\ 



1 _ / 

G In 



i"" 'T 1 _Y[i, 

Ala He 



TGG GTT 



1 G L 1 3 



TA TAT 



'A TTC TT\ 



'CT jCA TTG CCA CAT TAT GAT T 3G TCG GAG T3G GAI' TG 5 VIA AGA GGA 



p r o A "i n T 



18 5 



1 5 j 
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Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie- Leu Asn Lys Val Phe 
195 2C0 205 

CAT AAC ATA ACG GAC ACT CAA GTA GOT GAG CAC CTT TTC ACC ATG CCC C 

His Asn He Thr Asp Thi Gin Val Ala His His Leu Phe Thr Met: Pro 

2 1 J zlb 2 2 0 



67 3 



(2) INFORMATION FOR SEQ ID NO:L0: 

(i) SEQUENCE CHAP.ACTER I ST I CS : 

(A) LENGTH: 22 4 amino fields 

(B) TYPE: amino -old 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 

Trp Val Met. A.', a His Asp Cys Gl y His His Ala Phe 3er Asp Tyr Gin 
1 :■ 1.0 15 

Leu Leu Asp Asp Val Val Gly Leu He Leu His Ser Cys Leu Leu Val 



Pro Tyr Phe Ser Trp Lys His Ser His Arg Arg His His Ser Asn Thr 

t A' A z 



lie Arg Trp Tyr Ser Lys Tyr L- 



u Asn Asn Pro Pro Gly Arg lie Me*. 

/ 3 ■: 



Thr lie Al 



Thr Leu Ser Leu Gly Trp Pro Leu Tyr Leu Ala Pan 
^ 9 0 95 



Asn Val 



.y Are Pi- 



Arg Phe Ala Cys His Tyr Asp rri; 

105 1 1 j 



Tyr' Gly Pro I . ~ Tyr Asn Asp Arg Giu Arg lie Glu 



PO-- T 1 , 



Asp- A^a G.ly Val. Leu A., a Val 



;iy Leu Tyr Gin Le- Ala He 



Gl a Trp Val Val Cys Val Tyr Gly /V Pro Leu Let 



r r -j ri r s 



/i Asp Ser Ser Vu Trp Asp Trp Leu Arg Gly 



1 9 5 
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INFORMATION FOR SEC. 1 ID NO : 1 1 : 



(i) SE'jOENCE CHARACTER I ST I C3 : 
(Aj LENGTH : 13*': 9 base pairs 
{ EO TYPE: nuclvi : acid 
(CO ST PAN FEFNFS F : ioubie 
( D TOPOIF" ;Y : I ir ; e ar 

ill) MO-LEC'ILE TYPE : cFNA 



hypothetica: 
ant i -sense: 



NO 



(A . 



-fig: nal s<: 
organ: SM: f 



.1 ? l r.ui 



(viii IMMEDIATE COLLEGE: 
(Bo CF'ONE: pREF-7o-4F 



(AO 
C B ■. 



FEAT C PE 
NAME/ KEY 
FjCATjV'I 



<T>0 



OFNOE DESCRIPTION: SEO ID NO : 1 1 : 



AACACCACAC 0 FFAIAAOTT AGTC FT AAG A GAGAGAGAGA GAGGAGACAT TTCTCTTOTC 120 
TGAGATGAGC A 7TFCPCC TC CAGAI'AFCGA AGCCTCAGGA AAGTGCTTGA GAAGAGGTTG 180 



AGA ATG GGA 
Met GO/ 



GO GGT CG2 ATG TOT ACT GTC ATA ATG AGO AAG AAC 

j.y Gly Ara Men Ser Tr.r Va.l lie lie Ser Asn Asn 
- i - i - 



AOT GAG AAG A.- -A. OGA aoA AG 7 AGO GAG GTG GAG GGA GCG CCG CA7 AGG 
Ser Glu Eys Fys F_ y Gly Ser Ser His Tea GLu Arg Ala Fro His Tnr 



27 6 



aag cct got t; 

Lvs Pro F ro I 



lr\v Le 1 



;t aag c 1 



AAG AGA GCG ATG 



CA'I 



Ara Ala lie Pro Fr:> His 



TGC TTT GAA 



ICT TT' 



C TCA TTG TGC AAT TTT 



FAT AAT 



'yr FJe Ser Ser F'ro Fea 



'•" r Tyr Val Ala Trp Leu Val Tyr Trp Leu 

00 95 
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Phe Gin Gly Cys lie Leu Thr Gly Leu Trp Val He Gly His Glu Cys 



1 0 0 



105 



110 



ggg gat gat ::;ct 

Gl.y His His Ala 



TTT AGT GAG TAT GAG GTG GGT OAT 0A0: ATT OTT GGG 
Phe Ser Glu Tyr Gin Leu Ala Asp Asp He Val Gly 



HO 



12 5 



OTA ATT GTG -TAT 
Leu lie Val His 

13:- 



tgt gca gtt ctg 5 m T gga tat ttt tga tgg AAA tat 

Ser Ala L^u Leu Va L Pro Tyr Phe Ser Trp Lys Tyr 



14 0 



AGl: GAT GGG GGG 
Ser His Arg Arg 
1 1 5 



GAG GAT TCT AAC ATA GGA TGT GTG GAG GGA GAG GAA 
His His Ger Asn Lie Gly Ser Leu Glu Arg Asp Glu 



GTG TTG GTG GGG 
Val. Phe Val Pro 



.AAA TGA 



ber L//3 
165 



tgg aaa att tca tgg tat tgt aag tag 

Ser- Lys lie Ser Trp Tyr Ser Lys Tyr 



17 J 



17 5 



:ta aag aag GGG 

,eu Asn Asn Pro 



CCA GGT CGA GTT TTG ACA GTT GGT GCC AGG CTC GTG 
Pro Gly Arq Val L-u Thr Leu Ala Ala Thr Leu Leu 

HO IS z 190 



cr r r 



TGG GOT 



Leu Gly Trp Pro 
1 ^5 



TTA TAT TTA GGT TTG AAT GTC TCT GGT AGA GGT TAG 
Leu Tyr L-u Ali Pne Asn Val Ser Gly Arg Pro Tyr 

7-0 20 = 



2 1' 



2 1.5 



aga 3AA ago; :tt 



Arg j-i- 



Aru Leu 



:g TTT GT< 
.r Pne V a . 



T T T 1 



TT 
L e u 



:ag ATT 'I 
Gin lie T 



TAT GAG 
Tyr Glr. 
2 4.: 



ATT GGT GAG GTG GGA A TO" TTT GCC ACj 
He Ai a Asp Leu Gly lie Phe Ala Tn.; 
? 1 n 



OA ATG 
hr :o.-t 



AAA GGG TTG- GGT TGG GTA 
y s 3 1 y ^ e u Ala T r p j a i. 

2 5 0) 



ATG 



ATC T, 

[7: T' 



GGG GTG 
2 6 0 



CH ATT GTT AAG TGT TTG 
^.u lie 00a 1 Asn Gyi Pne 
7 6 c 



00a i 



ATG ATC A 1 " 
Met : I e H 



7 5 



, ■ - i - : i ■ 



:i i. S 

2 0 0 



■jr.... l .-a i 

r g ' F y r 



'OAT TAT GGO 

Asp Tyr Gly 



V a 1 Phe His Asn He Ala Asp Fhr Hi.: 



GTA GCT 'OAT GAT CTC TTT GCT ACA GTG CGA CAT TAG CAT GCA ATG GA : 



Val Ala Hi. 



U i O T 



,eu Phe 



rhr 



Pro His Tyr His Aia Mer Glu 



GCC ACT AAA GCA ATC AAG 3 3T ATA ATG G 3T GAG TAT TAG CGG TAT GAT 12:- 6 

Ala Tnr Lys A I. a lie Lys Pro lie Me: GLy :iu Tyr Tyr Arg Tyr Asp 
34 0 34 5 3:0 

GGT AGC CCA TTT TAG AAG OCA TTG TGG AGG GAG GCA AAG GAG TGC TTG 126 4 

Gly Tnr Pro ?ne Tyr Lys A L a Leu Trp Arg Gla A L a Lys Glu Cys Lea 

;35 3 00 ioT 

TTG GTC GAG GGA GAT GAA G 3A GGT GGT AG A CAA GGG GTT TTG TGG TAG 133 2 

Phe Val Glu Pro Asp Glu GLy Ala Pro Tnr Gin GLy Val Phe Trp Tyr 
3^:> 37 5 380 

CGG AAG AAG TAT TAAAAAAGTG TCATGTAGCC TGCCG 136 9 

Arg Asn Ly ; - Tyr 
3 0 5 

(2) INFORMATION FOR SEO I D NO : 1 2 : 

■. L; SEQUENCE CHARACTER I ST ICS : 

(A) LENGTH: : ~ : 7 amino acid.i 

( B ) T'fPE: amino acid 
(D) TOPOLOGY: linear 

(li: MOLECULE TYPE: protein 

{>:i SEQUENCE DESCRIPTION: SEC) ID NO : 12: 



Cis* Lys Lys G 1 y G... y Ser Ser His Leu G La Arg Ala Pro His Thr Lys 

Pro Pro Tyr Thr Los GLy Asn Lea Lys Arg Ala He Pro Pre- His Cys 

.3 0 4 ■:■ 4 5 

?he Glu Arg Set Pne Val. Arg Ser Phe Ser Asn Phe Ala Tyr Asn Phe 

3 0 3 5 6 0 

Cys Leu 3 or Phe Leu Ser lyr Ser- He Ala Thr Asn Phe Pne Pro Tyr 

n " " ? ; ' 7 5 8 3 

lie Ser Ser Pr: Lou Ser I /r Val Ala Trp Leu Va L Tyr Trp Leu Phe 

Glu Gl y Cys I i o Leu Thr GLy Leu Trp Val lie Gly His GLu Cys Gly 



t ] ^ y a } His S e r A 1 a Tea L 



l 4 n 



- s A r g Arg His His Ser' Asn lie Gly Ser Le u Glu Arg As p G 1 u Va 1 
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Asn Asn Pro Pro Gly Arg Yal Leu Thr Leu Ala Ala Thr Leu Leu Leu 
180 18 5 HO 

Gly Trp Pro Leu Tyr Leu Ala Phe Asn Val 3er Gly Arg Pro Tyr Asp 

19 c 20 ■■ 20 5 

Arg Phe Ala Cys His Tyr Asp Pro Tyr Gly Pro He Phe Ser Glu Ara 
2 1 0 2 1 5 2 2 0 

Glu Arg Leu Glr: lie Tyr He Ah:,. Asp Leu Gly He P'he Ala Thr Thr 

22 5 2 30 23 5 24 0 



Phe Val Lt;u Tyr Gin Ala Thr Met. Ala Lys Gly L^u Ala Trp. Val Met 
2 4 5 2 5 0 2 5 5 

Arg He Tyr Gly Yal Fro Leu Leu He Val Asn Cyo Phe Leu Val Met 
2 60 26: 270 

He Thr Tyr Leu Gin His Thr His Pro Ala He Pro Arq Tyr- Gly Ser 

2 " c 2 H 2 8 5 

Ser Glu Trp Asp Trp Leu Arg Gly Ala Met Val Thr Yal Asp> Arq Asp 
190 2 9 5 3 1 1 0 

Tyr Gly VH Leu Asn Lys Va 1 Phe His Asn He A', a Asp Thr His Yal 

305 OH 315 ' 32 0 

.12 5 3 3 0 3 3 E 

Thr Lys A.l a He Lys Pro He Met Gly Glu Tyr Tyr Arq Tyr Asp Gly 
314 0 34 5 3 50 

Thr Pro Phe Tyr Lys Ala Leu Trp- Arg Glu Ala Lys Glu Cys Leu Phe 

3 ; 3 6'I "'3 63 

Val Glu Pre Asp Glu Gly Ala Pre Thr Gin Gly Val Phe Trp Tyr Arg 

570 3 75 3b0 

Asn Ly s Tyr 

1 H c , 



INFORMATION PUR cEQ 13 NO: 1 3 : 



(iv) ANT I -SENSE : NO 
A) NAME /KEY : use 



11 



(B) LOCATI- )N: 1 . . 23 

;D) 3THER INFORMATION: /product^ 
"s yntri-ut i 
o2. Lgona z 1- :»*■ i " 

.xi) geo'/enoe peggri ption : seq id no:13: 

7GGGTATGOG A VGA! .G'G Y 5G tI<::A 



23 



(2) 



information for deo id no:14 



(i) ^£';n:i!I'^ CHARACTERISTICS: 
i A i DENG- T r. : 12 ha s ~ pairs 
; h ) 7 V P E : ; i u ■:• I e i r. ^cia 
vG ' G TFANPEDNEGS : Gr.qle 
G>j 70: ODOGY : iirvr^r 

■ i i i i-i' -decide : ype : -dma 

■ : 1 i i HVEGTH-lTI TAD: NO 

..lv) ant: -sense: no 

' ix i FEA7GRE : 
t A 1 1AM F. .'KEY: m l 5 r i e a t u r e 
B ■ LO' V.T I < '! I : 2 . . : .2 

■ p ■ < thee : n format :: • n : ,■ produ- 



.J 1 Hi L-'t, 



EECk I FT T i"'N : SEO TD N 



o i_j v ->- lI l^^' ; j. 4 : 



AAARTGRTGG G 



7G 



(2) 



IK FORMAT ION FOR GEO ID KG: 15: 



{.i GEo :ENNR CHArACTEP I ST I CS : 
LENGTH: 2 97 3 L ase pairs 
7 Y - x -~ : clf-ic 3 c i cl 



double 



i HYPOTHETICAL: NC 



.. i )NE : pAGE'2 - 



(ix) "EA7UKE : 
: A) NAME, KEY : exon 
;B; ROGATION: 433. . 



(ix) FEATURE 
(A) NAME / KEY 
(3) LOCATION 



intr^n 
521 . . 1654 



{ xi ) SEQUENCE DECOR I PT I ON : 
ATTCGGTAAT TOG lACAl' AT TTFAjAjATT 
GATTATAAAT TTAAAATACG TACTTTTCGA 
TGATATTAAA AAGTTAATGT TOACT3TTAT 
TA^AGTAGOAT AAAAATAOCA GATATTTTGA 
GG AGAAAGAA AAGAAGC : TGG GGGOCCAGTA 
TCTOTCTCTC TACCTTCGAC CCAC AGGCCG 
CCCOATCTGA GCAACACG\AG AAGAGOOACA 
AGAGA jACAG AGA AAGAGAG AGATTOTGCG 
GTTATTAACG TTATCGCCCC TACGTCAGCT 



SEQ ID NO: 15: 

AGTTT SAGTT TCCATC :ATA CTTTA:TAGT 6^ 

ctataaagtg aaactaagta aattagaacg 1.10 

atttttt toa caa ataaaaa atg agttat v 1 -o 

a 3ttgaaa taa 3 ag ag g a .co!) 

atgtcatc aaatcatot: ?-)0 

aCOCrGTC TCTTGCCAT T • *>:: 

AAAAA j AG A.3 A 3 AG AG AG 0 j 

:ttcgtag gat^ttcatc 40) 



attgattaaa aa- 
tgaaagggaa ag. 
tgtooattta .aa' 
cactcacaaa tt 
gaggagcttc tt 

OO^TC'tVOA " ^ p 



TTCTTCTCAT TTTCGATTTT GATTCTTATT TCTTTCOAGT AG 



T ~'T ZTTCCAT 



:tcctgot ctgtgaattt 



i i ALrt i i 1 .A. A i 1 1 b'j I/O I'O'jA'I'T 



^TTAATAAT GATSAACCCT C FCATTCATA CAATGATTAA TT'R; 



:gt otacc; 



ATATGTTGCA TTTTCACCTT TCTTOTTTTT TTCTAAGAT 3 ATTTGCTTTG ACCAATTTGT 
1TAGATCTTT ATTTTATOTT AT T TTCTGGT GGGTTGGTG j AAA 1 TGAAAA AAAAAAAAAA 
AlAAA G ■ J A T AA ATT ATTATTT GTTAATGTAT TCATTTTTTG GCTATTTGTT OTGGGTAAAA 

ATTGGGTT TTTATAGTAA 



ATOTGCTTOT ACTGTTGAAT i'TTTOi^TGGA TTTTTTAOTC 
.AAA T A C A T AA TAAAAGGAAA ACAAAAGTTT TATACATTC T 



'P'T i TG<- A A AATTTTOTA 1 ^ -VR^'^i'TOGT^ A'*" ATTTTAT T TT^Ti"] 



ji A L TAOA 1 . 



\ A AT ATTT - A Zj. T 



^ u 



-TOT AC 



" , /\TT[" , '!""T' rnr T 1 T 



TGATTTOOO AC I'V'i'GbAT 1 ' 



CACAAGCAAG AGATGTGACC TGCCTTATTA AOATCGTATT 1 3 !- 0 



■ 1 /A 1 



"TTTCTATTT GTGGC.AATOC i 
"G'i'OT GA AGACTCTCTT G.AATCGTTAO OAOTTC7 
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GOTTGTTGAT TGTTTTGOTT TTGGTTTT 27 
G Z 3TT0 TTAO T TGTTGG.AA Z AAA TO 3 3 AAA 
AA :GGG 3TTT OTOGGTGGGA GATGTGAAGA 



:ttatga 
:tootgo 

3T00TAA 

:aa.tggo 



oaatog:tog ztctttctc: tag 
aogtogggag oaattaotto tgt 
TOTATTGGGG gtgtgaaggg tgt 
GTOAOOAOGG ATTCAGGGA: TAG 

ogttootoot ggtcggt fag ttgtgotg za 
otg aat :ggt :gaaagaaaf gaa3tatttg 
agg 3gaaata ootoaaoaao ootottggao 
t0gggtg30: ottgtaotta g 30tttaa0g 
g:gatttgtt ggggaagggt g:oatotaga 
gtgatgg:;gg tattgta^oo gtotgttttg 



g :a:;aaaoat 
g :gaoaooa: 
aagoaatggg 
gtga:atgat 
gtgaggotot 
otggtatotg 
tgga tgagag 
agtatagtga 
tgg:aaagga 
3:atgatgat 

T OrOTGGOAG 
ATGAGGGAGA 
3T0TTTA00G 



GGGT 30AGGT 
AAAG Z jTGTG 

gg :goattgt 
tatagggtga 
gtgttaottg 
ggtoatagoo 
agt tggtgtt 
toggogtoa z 
aaaatgag za 
gttaaoogt: 
agg 3tatga0 
aggootogag 

TTAOGOTG Z7 

A ; i '[' AAA I j . r " ^ 



GGAAG.AATG T 
00 3TG0 3AGA 
TTOAAAGGOT 
TGOTTOTAOT 
GGTTGGGGAG 
GAOGAATGOG 
ATGTTGGATT 
GATTGGAAGA 
ATGAAGTGGT 
GAGTTTGTGG 
GGGTTGGGTT 
A TATAOGTGT 
GGAGAAGGGA 
TTGGTG 3TGT 



168 j 
I'M) 

1 8 0 0 
IS 60 

2' A 0 



. 4:lG 



tgatca:tta :ttgoa3oao aotoa r zgzt : gftgootoa :tao aatt za toagagtggg 



a:t3GG]::a3 g^gagottt^ g^taoo^fag acasasacta og^aafotfg aagaaggtgt 
t00a0aa0at tagaga zaca gaogtgggto at:a0:tgtt gtoaaaaatg ggggattata 



a :goaa'f z-za agota aaaa z 
gaaoao : zt z afatgtggo z a tgt at ag zz 

gaa gaaatt z t:gagtttt: TGTTGTGFG1 



GAATT OFGG Z AGAOTATTAO GAGTTG 3ATG 

AZZ AAAAGGA GTGTATGTAT GTAGAAGGGG 

.j Z ZA JAAGAA ihA j i A— i . '.A^A F'jAT'j'jT 

i jAAlj" TATGGTTGGT 



TTuT ^ T T T T 



TTTTGTTGTG T TAT GAG ATT TTGGGTGGTJ ATTATGTTA1 



: 6 4 ■ 



zzsz 

:^4 0 



\ ) ^L..Vji;i: J DdSt: pdi CS 

G 3 F RAN D E DM E S G : single 
Til MO LEG ULE TYPE: z ON A 
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fiv) anti-senge: n:» 

(. ix) FEATURE : 
(A) NAME ' KEY : raise feature 
f B) LOGA7I> 'I J: 1 . . 2 

(D) OTHER T !J FORMAT I 7N : 'product^ 

"? yr.th-- i . 

C).l iqcm.: " 

(xi) GEIHENOE E'ESOEI PT TO: I: 3EQ ID NO: 16: 

G7GCATGTNG ARAA!iAi-RT^ RTG 2 3 

(2) INFO F.MAG 1 N RO- SEC? ID NG>:17: 

{ i } GEO' 'ENOE CHAF'AGTEF.I JTIC3 : 
(A : , EN 3T H : 1 2 ba s •:- pi a i r s 
(R TYRE : :.u =c.ic 
(C 1 S7FANDEDNESS : single 
(EE ^OPO:.!"::': .linear 

( i i ) HgEEG !, OE TYRE : cG-NA 

{ l i ) HY r ■ ''I RET I GAL : NO 



(A I J AT IE KEY : mi s c rea:ure 

(R l< >ga: i( :;: 1 . . i\ 

( D ■ OTHER : * J FORMAT I ON : product^ 

oilmen; '7 1 t :. do " 

;xi; EEI'G.NOE EMERGE I P7I0N: 3EQ ID N ( J : 1 7 : 

GGGGATGTRG I 1 ;-2^GN/-2G*.TG F.70 23 

[2] XNFGF.MA7R' N FOE 3EI ID NC : 18 : 

( i ) EG07EM0E OHAFA07E? ISTIGS : 
(A.- I2-NG7H: *: amirij acids 
(& 7YRE: ,-iniir.C' a r< i d 



I N r 'jF.[ 1A7 I ■' 'N EOF. GEO IE NO: 19: 

G- ) GE07ENGE CHAFAOTEEI ST 1C 
(A; LEN3TH: £ am in:; acids 
( B ) TORE: im i no a c i d 
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(ii) MOLECULE TYPE: protein 



SEQUENCE DESCRI PTION : SEC ID NO : 1 9 
Ala lie Pro Lys His Cys 

(2) INFORMATION LOR SEO ID NO: 20: 

(i) O^EOUENOE CHARACTERISTICS: 

(A) LENGTH: 6 rami no acids 

(B) TYPE: imir.c aeic 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Li x ) FEATURE 



NAME / KEY 



un.-ur e 

f.B) LO TAT I ON: 
tD) OTHER INFORMATION: Xaa = Pro or Lys 

(xi) SEQUENCE DESCRI PTION : SEC ID NO: 20 

Ala lie Pro Xia His Cys 
1 

CJ.) INFORMATION EOF. LEO ID NO: 21: 



amm:- at; ids 



(A) LENGTH: 

( F ) YY EE : .-.mi r.o ac i a 

f D ) Y l . Y'OLOG Y : 1 ine ar 

fii) MOLECULE TYPE: priOein 



I ixj FEAT UE E 
f A) NAME/ KEY 
i P ) L< ■■ YAI I ON 



a a = Leu o r lie 



( D ) OTHER INFORMATION: 

ixii YEOUENCE DESCRIPTION: SEF ID NO : 2 1 
IrD Pro Xaa Tor I rp 



INFORMATION EOF YEQ ID NO : 2 2 : 



T r p Fro Leu T y r T r p 
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INFORMATION FOR SEO I D NO: 23: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: o amino acids 

(B) TYPE: and no acia 
(D) TOPOEoGY : linear 

'.'11] MOLECULE TYPE: piotem 



{ B; 



FEAT'. 1 RE 
NAME /REV 
LOCATION 



■;ns u re 
3 



Trp 
1 



f D; OTHER I N FORMAT ION : 
i . Xl ) SEQUENCE DESCRI ET: 
Pro X.Ba Cyr Coo 



aa = Leu or lie 
: SEQ ID NO: 23 



INFORMATION FOR SEO IE- NO: 24: 

i.L i SEQUENCE CHAFACTERISTICS : 

i'A: LENGTH: E am in:- aciGo 

I B ; T Y E 1 E : ami no a c i a 

[ D ) ?Or'OL ; ) .";Y : li ne ar 



i ■ .: ) Mi i EE- ,; ' 1 



A ...a His G.Lu 
1 



Hi : 



information fof. seo ir nor; 5: 

■ o i SEQUENCE CHAFACTEF.I STICS : 

(A: LEN7.TH: G ami acid::. 

; L- ; TYEI: amino aoio 

I R : TOEM - >GY : linear 



:CE DESCRIPTION: SEQ ID NO:25 



o 1 < * u ; c- a - r 



ami.no acia 



ix; FEATURE 
C OTHER IN 
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■ix) FEATURE: 

(A) NAME/KEY: unsure 

•;B; LOCATION: 3 

-I') OTHER INFORMATION: Ma a = Asp or Glu 



:RIPTTO]j: SEQ 1U NO : 2 6 : 



Xaa His Xaa Oys O-y his 
1 

(2) INFORMATION FOR SEQ I P NO: 27: 

r) .i'EOVENOE CHARACTERISTICS: 

f I\) LENGTH: :■ amir.':' acids 

i R ) TYPE : am i no ac i a 

i L- TOPON- 1 ir-e ,r 

i:r) MCCEOf'LE TYPE: pr:- 4 ;ein 

■;-:::_) SEQYENCE PESO?. I ? T ION : SEQ ID NO: 27: 

Leu Leu V al Pro Ty r 
1 

(2) I N F0>RMAT ION FOR SEQ IP NO: 28: 



P TV: 

P TOO 
. ^ i ) lie: 



im i r.c ic i a 
l' : - i r.ea r 



" : LE TYPE: pr e i n 
">:i) SEQO'ENCE PESO'R I E T I ON : SEQ ID NO : 1 

He Leu ■'a I : : : Ty r 



INFORMATION FO? 



NO : 29: 



, ) .:■ E,QN FN< "E CHARACTER I FT ICS : 

; A LFNOTH: : amir.': acids 

, p. TYPF: ".rr.ir.c n r i a 

' [J ' TL'PO-L: 'VP lineal 



. 2. cTHER 1 NEcRPIA II cN : Xaa = Leu or lie 

;x.) SEQUENCE DESCRIPTION: SEQ ID N0':29: 
Xaa L e u V" a I P r o T y r 
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(2) INFORMATION FOR SEG ID NO: 30: 



Tr P 
1 



;i) SEO n ENCE CHARACTERISTICS: 

(A) LENGTH: tf amino acids 

[B; TYPF: a^ino icia 

f D ' OTOLOGY: li;:e,r 

ni) MOCEC'IEE TYPE;: pi: :ein 

i ."EOVENCE DESCRIPTION: SEQ ID NO: 30 

Lys Tyx S^r H i .5 Ar>i 



(2) INFO F.MAT I ON EOF. SEO ID NO:31: 

) CEvVENDE CHARACTER I STICS : 

A. LENGTH: \: amino acids 
i B 1'VFE: .imir.o jcia 

i R T". < r- ODO 3Y : O. r.e^r 



Trr 



(ii) KOLECNLE TYPE: protein 

CEOOENCE DESCRIPTION; SEQ ID NO: 31 
Arq lie ."*--r Mis Ar i 



OEvOHN 1 :E CHARACTER I 3 TICS : 
LENGTH: <:■ .umirio aoi ds 
T HE.: -.or. ; ;o.> ac i a 
?! T OLOGY : 1 irie 5 r 



17 'T-vpi; 



1<] r ^ i . 
AO NAME/?;EY: 
! h L' ".AT 7 ■■ 'I I : 



Xaa = A: 



O 1 ■ (J ; 



'JNFCRI-IATIC'N: Xaa = He or Tvr 



OO Lr.NGTH: 6 amine acids 
NO IV ; F : ammo acid 
O-O ToR:Ao3Y: lined; 



j EC 10 iE 'I Y F- E : i >rof e in 
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fxi) se:>uen:e DESCRIPTION: 



SEO ID NO: 3 3 : 



Ser His Arg Arq His His 
1 



..i) seo^ence CHAFACTERISTICS : 

■ 70 LENOTH: t dmir.o acids 

i P.) TYPE: ..minij acid 

ip) TO: 'OLOHPi' : linear 

- : i ) Mi > P E C.J : , E TYPE: pr:-'.ein 

f>:i) SEOEENCE PESCR I PT I OH : SEO IP NO:34: 

Ser His Arg Tnr His His 

1 

(2) INFORMATION FOR SEO ID MO: 35: 

■, r) SEOEENCE CHAFYCTEFd STICS : 

(70 PENGTH: 6 .amine acids 

ib) TYPE: and ne acid 

i L 1 ) TuPOPOSY : .1 ire sr 



( : x ) FEAT PRE : 

(A) NAME CRY: ;.ns:re 

fp) P.i^'ATjVJ: 4 

CO OTHER INFORMATION: Xaa --= Arg or Thr 
ixij SEOEENCE PESO? IPTION: SEO IE NO:35: 
Ser His Ara His H i s 



INFORMATION EOF. S EC' ID NO: 36: 

i) REOPEN- 'E CHAF'ACTEF. I ST ICS : 
;A) LENGTH: 5 amirs acids 
CP TYPE: • r ; r~ ■- b i d 



INEOF.M/C I (IN RPR SEC IP 



; i ) . '» p: o o e:j-:: e c h a r ac t e r i s t i c s : 

;A; LEN3TH: 3 amine acids 
;B) TYPE: iiTiino acid 
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di) MOLECULE TYPE: pntein 



Val 
1 



(xi) FEF'DENCE DESCRIPTION: SEQ ID NO: 37 
Thr Tyr L- j His 

INFoTTlAT ION FOR FEO ID No: 38: 



' l ' 
(A) 
( 1- J ) 

■: v) 



1 X ! 



FFFYENOE CHAFACTERI FTICS : 
jENOTH : 5 .imin:< aci-ls 
n YPE : irr.i n "> =i c i ^ 
1 irie a r 



F.'POLOOY 
M'"'LECMLE TYPE: 



pr : :! "..ein 



FEATURE : 
>Ai NAME /FEY: „n -.a r>~ 
' D ) Li.»."AT I ON : I 
T } OTHEF INFOFOIAFION : 



la a = lie or Val 



FEAT UP. E 

:;ai-:e/ key 

Li n *AT 1 ON 



■ .no a r 



■m other tnfoffiati 



Faa = Gin or His 
SCr.I PTI-" N : SEQ ID NO : 3 8 



INFORMATION EOF FEv ID No: 39: 



'.A. 
P . 

■ P 



Leu 
1 



s e oof::: o e c i i afa - : t e r i s t i c s : 

LiENOTH : A ,iuiinj aoiols 
7 Y F E : ami ro • a c La 
F 'P: LOGY : ir.e 5 r 

■ :.l Mo PEOPLE I ":' PF : protein 

xi SENTENCE DESORI FT ION : S5 

>ro H i s 'I 1 -/ r 

INFO-FFJAT ION EOF. FEO I D IIO:4 0: 
: i ■ FFF-FFNFE C}{ AFAF^F R T ^TI^S : 

i - i 1.1i'iT.rO'T T P pit . r ^ ^ ^ 



F FOUFN'^F 



J_i e U E r C 
1 



(.?.) INFORMATION FOR SEO ID NO : 4 1 : 



;i) 3EQFENCE CHARACTERISTICS: 

(70 LENGTH: 4 amino acids 

(H) TYPE: amino icid 

O ) ) T : "> PO I. ; > "J Y : ^ i r.e ^ r 

iii) II 1 >LE I FEE TYPE: pr-rem 

<i:<) FEATURE: 

(Ai NAME/KEY: insure 

t R ; L< iCATION : 

Oo OTHER INFOF01ATION: Xaa = His or Trp 

ixi] FEO'-'ENCE DESCRIPTION: SEQ ID NO : 4 1 : 

Leu Pro Xaa Tyr 
i 

(2) INFORMATION FOR FE-y ID No: 42: 

■■i) FE0VEN0E CHAPACTERIFTICS: 

(A, LENGTH: t .-imi n acids 

( b ; TYPE : um.i n*.- ic. i d 

(Fi T-'iRO'L/'GY : linear 



i : .l ; N—DEC'OE IYPF: pr::-*ein 

(A NAME ' FRY : :n.Eure 

:P- 1/ ".'AT :ON ; 

(L-- OTHER IN FOP MAI I ON : Yaa - Arg or Lys 
(>::u FEyFENFE DESCRIPTION: SEQ ID NO : 4 2 : 

Trp Leu Xaa /. . y AO-. I.e.. 

1 

(2) INF(;?FIATICN EOF NEC II 1 NO : 4 3 : 

i ■ . 'F0YEN< "£ CHARACTERISTICS : 

(A) LFNITH: F . :iriin>: acids 

(b) TYF-E: .'-rru n 1 ..' 3 c i u 

y [ ; T l »! ;GV : : i r^ 5 r 

( -.i ■ IF'FFO 1 1 ■ jE TYPE: pro*: (5 in 

(2) I N FORMAT I ON EOF ?F0 T F : No : 4 4 : 

i : -yon^ojo^ ^HAFAX'TERI FT I CS : 

■;Aj LENGTH: 6 amine- acids 

( B } "O'PF: ami no acid 

. Di Ti.jPOLOGY : linear 
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(ix) FEATURE 

(A) NAME /KEY 

(B) LOCATL >N 



unsur- 

1 



( D) OTHER INFORMATION: Ma a = Trp or Tyr 

; i x) FEATURE : 

i A [ I I AME / EE Y : unsur- 

(p; LOCATION; 3 

( F. 1 } OT HER I N FORMAT I ON : Xaa = Arg or Lys 



fix) FEATURE 

( a; name /key 

c P ) LOCATL )!I 

fP; i .'THE P. INFORMATIS 



Xaa = Ala or Gly 
ixi) SEOUENOE DES :RI PTION : SEO ID NO:44 

Xaa Leu Xaa Coy Xa-a Leu 

1 

(2) INFORMATION FOR SEQ 10 NO : 4 5 : 

U) SEQUENCE CHAF.ACCERISTICS : 
(A) LENGTH: 7 ammo acids 
(R: TVl-'E: -mi no so id 



(x- ) S LvYHNCP DESCRIPTION: SEC- ID NO: 45 
Thr Val Asp A: ? Aop Tyr C y 

INFOFLMAT IC-:: EOF. SEO 1 I L NO: 46: 

fr) °- - !r"E CHARACTERISTICS: 
(A: LENGTH: " amir.L acids 
:P; TY:E: ammo ac:a 

fir) IE 1 'CLE TYPE: pi .. \e ; :i 

C;C SF'SCENOE DESCRIPTION: SEO; ID NO : 4 6 



(A) LFN'-;TH: 1 amino a-ids 

'. P .. i : b : , -.mine acid 

: D 1 TOP OL< >GY : 1 m^a - 

OS MOLECULE TYPE: pre tern 
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(B) L iCATD!]: 2 

( P ) OTHER INFORMATION: Xaa = Leu or Val 
AEQMENOE PXSORIPTION: SEQ ID NO : 4 7 
Thr Xaa Asp Ara Asp r Iyr II] y 

(2) 1H FORMATION EOF' SE'J ID NO : 4 8 : 

■ i } i-'SOVenoe charaotep.i jti rs : 

■.A) Lj hi ! 7 ■ I- ' I H : ■:■ amir.o acids 
(bi TYPE: -imino 3 rid 
( Fjj ToPODoOY: linear 

\L'i) iLE'" : ." i jE TYPE: p: ■: "ein 

■>:i) ."e;>ve:;ol desori pt:-jN: seq id no : 4 8 

Thr His Vsl A . e. Hi.-: His leu Phe 
I 

(D) IKFOFI1A70ION FO~. PEl* ID ].*:-: 49: 

-.i) te^te:;te CHARACTERISTICS: 
i A ' 0 E ! J AT H : 5 a m i r. ~ a < "• i i i s 



Thr His Val 
1 



10" HE. TYPE: pi :vein 

:L DESCF.I FT:-!!!: SEQ ID NO: 4 9 
H- s His Deu Phe 



IKFijF.I1/-.7 I ON FC? SE'j ID NO: 50: 



;haractef i:;tics : 



O NO'I H : 8 amir.o acids 
r ' O 'jTi 'OY : linear 



x) HEATYF 



INFORMAT I ON FOR SEO IP !IO:51: 

'. i } S ECU EI JOE O t j A AAOT ERI ST I 



3S 



B) 



TYPE : amino aci i 
T : 'POLOGY: l inear 



His His Leu 
1 



MOLECNNE TYPE: protein 

SETUP: joe; DESCRIPTION: SEO ID N0:S1 

r S-r Trcr Met Pr: His Tyr 

1 0 



(2) IN FORMAT ION FOR SEC II 1 ri'I- : 52 : 

ii) SEOYFIJOE CHARACTERISTICS: 
[ A ) 1 , ! N G T H : i 2 a m in-: a :• ids 
\h) TYPE: amino ac id 



LEO" I IE TYPE: pr-: t.ein 

C'YENOE PESCRI PTION : SEQ ID NO: 52 



■ >:i ) 



His His Leu Pne Pro Gin lie Pre His Tyr 

1 : 10 

(2) INFORMATION FOR NEC IP NO: 53: 

■ i r t ( "* tt 1 H A YY' T F P I S T I ^ S " 



I 1 ; 



>ii) M'.uLEO ; 'L^ 1 VPl: protein 

i i x) FEATYPF : 

{ A ) NAME t KF Y : unsure 

Y) l—;at:-:i: 

M 1 } OTHER. INFORMATION: Xaa = Pro or Ser 



i : x ; :- oA i I r : 

i/O NAME ' FF Y : unsure 

■ P ; L : ;: YAI lull: e 

MY OTHER INFORMATION: Xa; 



^ A; NAME FIY: 
(b) L 1 'CAT PC 



I N FORMA I I ON EOF. SEC I D NO : 5 4 : 



NEYYENCE CHARACTERISTICS : 
L E N 5 T H : i 2 base pairs 
TYPE: nucleic i • i i 
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(ii) MOLECULE TYPE: cDNA 
(in ) HVF' ''TMETICAL: NO 
( iv ) /v! IT 7 -SENSE : NS : 



misc texture 



jFMATION: 



produ zt ~- 



(ix) FEATURE 
(A) NAME ' KEY 

■■p. ■ l* nation 

' E' ' 07HEF. MI 
ol igonue.l -"-or. ide " 



(xi) SEQ' 'ENCE DESCRIPTION: SEQ I D NO:54 
ATAGCCCCCC AA 



12 



\2) 



INF0P.MA7Ii.-N F< >F. SEQ ID NO: 55: 

(i) SEQUENCE CHAFAvCTEP.ISTICS : 
(A; LENGTH: 12 base pa i rs 
;B) TYPE: nucleic acid 
( C STi-ANE'EDNESS : smcjle 
: [ ' ■ T^POFO^Y: linear- 



Mi 0 



A J A I Ot=T.07Tll C ^ 



iv! ANT : - SENSE : NO 



SEi, 



NO : 55 : 



v2; INFORMATION F< >F. SE.O ID NO: 56: 



12 



e r " e : : i :: :-: c ; { arao t e pi s t i c s : 

LENGT H : I 5 base pe* i rs 
r E Y ! F : rue lei'": arid 
SI FANEONNESS : .= inc; I e 
T'" : : C NO r : Y : 1 ine 3 r 



( J ! OTFEF. INFORMATION: 
" s vr. h ^ * i ^ 
oligonu- . 1 e _it: ide " 

( x i ) SEQUENCE DE?( 



3 EC 
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(2) INFORMATION FOP. SEQ ID NO: 57 



( L ) GEO ! . ; ENCE CHAFAGTERI STICS : 
i A) LENGTH: 1 J base p:nrs 
i B ■ T V f E : n o clei: ici ■: : 
; c; • STFAN: EDNEGG : =.in;ile 
■ [> ;■ TOFOL' 0;V : i i r.e.*r 



F!< -LE' 'OLE TYPE 



c L'NA 



i i) HYPOTHETICAL: NO 



iv) A: IT I 



no 



■ i x ) t E.m. i r'.^ : 
'A' NAME EEY : ni^c I:e:i:ure 

■;b> l-xnation: I. . i-j 

ij O T H EE IN EC ' P.MAT I ■ J N : /product^ 
ol i ac rv.i: 1 e : * .:oie " 



:xi) OEOCE-CE DESCRIPTION: SEQ ID NO: 57; 

GTCGACGAGG 



10 



N O; - : 11 h : , : ro 

1 I* 1 V ■ E ; 1 1 1 'i'. I b i i" E: c:m : i 
'CO SI PAN.' E; )NEG3 : ioiroue 
■ [" Cv T I'D' -O-Y : iir.e :. r 

.Li) MOLE'-'ILE TYPE: cDNA 

v -Li) H YE' OHET I OAL : MO 

iv) ANT I - CENSE. : N j 



:A ■ NAME 



" i on 



ini5C :0-. :i ; or e 
\MAICjN: / product - 



(2) INEORMATI-jN EOF. SEQ I D NO: 59: 
1 < .. K-J'J :* N< " E G :1 A FA I T E F. I ST I C :.- 



NG TYPE: amino acid 
ON ToPOL jGY : linear 
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!xi'i SEQUENCE DESCRIPTION: SEQ ID NO : 5 9 : 

Ala He Pro i'ro His Cys ?he 
i 5 

(2) INFOF:MATION FOR SEQ IE) NO: 60: 

■ i ■> SEQUENCE CHARACTERISTICS: 

',70 LENGTH: 6 amino acils 

R ) TVER: -.mir.o acre! 

.Tii TOPOLOGY: lir.e = r 

■ li; MOLECULE TYPE: pry:ein 



i : x ;■ FEAT r IRE: 

■ A' NAME 'ELY: unsure 

:Ri L 1 i'"AT D >f I : :i 

• [' i OTHER I [JFOF.MA'I l ON : 



Raa = Leu or Pro 



i i :<) FEATURE 
i. A) NAME / KEY 
i h ) L' NATION 



u n s u r e 
4 



; D > OTHER INFOFMATR 



;aa = lie or Leu or Val 



'>:i- FEQREN :E EESCRIPTION: SEQ ID NO:60: 



AA-Ld Tip Xaci R -i d Y': > ' i ri> 



(2) 



IL'FOFTLAT ION FOR SEQ ID NO: 61: 

• : • SEQUENCE CHARACTERISTICS: 

i 7S, LLNOTH : 5 amino acids 

t E- ' TYF-E: "iinir.o acid 

i D ) T' Y-'OLOGY : 1 inear 



His 



MoLECULE TYPE: pro.ein 

SEQUENCE DESCFIPTICN: SEQ ID NO: 61 
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